Multi-View Feature Representation for Dialogue Generation with Bidirectional Distillation

نویسندگان

چکیده

Neural dialogue models suffer from low-quality responses when interacted in practice, demonstrating difficulty generalization beyond training data. Recently, knowledge distillation has been used to successfully regularize the student by transferring teacher. However, teacher and are trained on same dataset tend learn similar feature representations, whereas most general should be found through differences. The finding of is further hindered unidirectional distillation, as obey may discard some that truly but refuted To this end, we propose a novel framework, where learning more line with idea reaching consensus, i.e., common beneficial different yet all datasets diversified partners. Concretely, task divided into group subtasks number students. Each assigned one subtask not only optimized allocated also imitates multi-view representation aggregated other students (i.e., peers), which induces capture among alleviates over-fitting subtasks. enhance generalization, extend bidirectional encourages its peers co-evolve exchanging complementary each other. Empirical results analysis demonstrate our framework effectively improves model without sacrificing efficiency.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Data Distillation for Controlling Specificity in Dialogue Generation

People speak at different levels of specificity in different situations.1 A conversational agent should have this ability and know when to be specific and when to be general. We propose an approach that gives a neural network–based conversational agent this ability. Our approach involves alternating between data distillation and model training : removing training examples that are closest to th...

متن کامل

Multi-view Feature Learning with Discriminative Regularization

More and more multi-view data which can capture rich information from heterogeneous features are widely used in real world applications. How to integrate different types of features, and how to learn low dimensional and discriminative information from high dimensional data are two main challenges. To address these challenges, this paper proposes a novel multi-view feature learning framework, wh...

متن کامل

Weighted Multi-view Clustering with Feature Selection

In recent years, combining multiple sources or views of datasets for data clustering has been a popular practice for improving clustering accuracy. As different views are different representations of the same set of instances, we can simultaneously use information from multiple views to improve the clustering results generated by the limited information from a single view. Previous studies main...

متن کامل

On multi-view feature learning

Sparse coding is a common approach to learning local features for object recognition. Recently, there has been an increasing interest in learning features from spatio-temporal, binocular, or other multi-observation data, where the goal is to encode the relationship between images rather than the content of a single image. We provide an analysis of multi-view feature learning, which shows that h...

متن کامل

Generation for Dialogue Translation Using Typed Feature Structure Unification

This a r t i c l e i n t r o d u c e s a b i d i r e c t i o n a l g r a m m a r genera t ion sys tem called fea ture s t ructure-directed generat ion, developed for a d ia logue t rans la t ion sys tem. The sys tem utilizes typed feature structures to control the top-down derivation in a declarative way. This generation system also uses disjunctive feature structures to reduce the number of co...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence

سال: 2021

ISSN: ['2159-5399', '2374-3468']

DOI: https://doi.org/10.1609/aaai.v35i14.17516